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This article presents a new technique, adaptive replication, for automatically eliminating 
synchronization bottlenecks in multithreaded programs that perform atomic operations on 
objects. Synchronization bottlenecks occur when multiple threads attempt to concurrently 
update the same object. It is often possible to eliminate synchronization bottlenecks by 
replicating objects. Each thread can then update its own local replica without 
synchronization and without interacting with other threads. When ... 

Keywords: Atomic operations, commutativity analysis, parallel computing, parallelizing 
compilers, replication, synchronization 
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Publisher: IEEE Computer Society 

Full text available: ^Qpdfd 57.0 1 KB ) Additional Information: full cita ti o n, abstract, index terms 

Parallel and distributed simulations enable the analysis of complex systems by 
concurrently exploiting the aggregate computation power and memory of clusters of 
execution units. In this paper we investigate a new direction for increasing both the 
speedup of a simulation process and the utilization of computation and communication 
resources. Many simulation-based investigations require to collect independent 
observations for a correct and significant statistical analysis of results. The execution ... 
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LISP and functional programming LFP '94, volume vn issue 3 
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Full text available- pdf(919 87 KB) Additional Information: full citation, ab stract , references, citings, index 
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We have implemented a concurrent copying garbage collector that uses replicating 
garbage collection. In our design, the client can continuously access the heap during 
garbage collection. No low-level synchronization between the client and the garbage 
collector is required on individual object operations. The garbage collector replicates live 
heap objects and periodically synchronizes with the client to obtain the client's current 
root set and mutation log. An experimental implementation usi ... 



5 Fast det e ction of communication patterns in dis tr ib ut ed ex e cutions 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced 
Studies on Collaborative research CASCON '97 

Publisher: IBM Press 

Full text available: ^ | pdf(4.21 MB) Additional Information: full citation , abstract, references , index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based 
on process-time diagrams are often used to obtain a better understanding of the 
execution of the application. The visualization tool we use is Poet, an event tracer 
developed at the University of Waterloo. However, these diagrams are often very complex 
and do not provide the user with the desired overview of the application. In our 
experience, such tools display repeated occurrences of non-trivial commun ... 




Concu rre nc y and distribution in object-oriented programming 
Jean-Pierre Briot, Rachid Guerraoui, Klaus-Peter Lohr 
September 1998 ACM Computing Surveys (CSUR), volume 30 issue 3 
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Full text available- f* 1 ) odf(289 34 KB) Additional Information: fuJLcjMign, abstract, references, citings, index 
• [£|_h__a terms 

This paper aims at discussing and classifying the various ways in which the object 
* paradigm is used in concurrent and distributed contexts. We distinguish among the library 
approach, the integrative approach, and the reflective approach. The library approach 
applies object-oriented concepts, as they are, to structure concurrent and distributed 
systems through class libraries. The integrative approach consists of merging concepts 
such as obj ... 

Keywords: concurrency, distribution, integration, libraries, message passing, object, 
reflection 
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Additional Information: full citation , abstract , references , citings, index 
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Full text available: ^g|pdf(1.72 MB) terms 

Multithreaded processor architectures are becoming increasingly commonplace: many 
current and upcoming designs support chip multiprocessing, simultaneous multithreading, 
or both. While it is relatively straightforward to use these architectures to improve the 
throughput of a multithreaded or multiprogrammed workload, the real challenge is how to 
easily create parallel software to allow single programs to effectively exploit all of this raw 
performance potential. One promising technique fo ... 

Keywords: Thread-level speculation, automatic parallelization, cache coherence, chip- 
multiprocessing 



8 Very concurrent mark-&-sweep garbage collection without fine-grain synchronization 
Lorenz Huelsbergen, Phil Winterbottom 

October 1998 ACM SIGPLAN Notices , Proceedings of the 1st international symposium 

on Memory management ISMM '98, volume 34 issue 3 
Publisher: ACM Press 

Full text available' f 8 ! Ddf(1 36 MB) Additional Information: full citation, abstract, references, citi n gs, index 

ter m s 

We describe a new incremental algorithm for the concurrent reclamation of a program's 
allocated, yet unreachable, data. Our algorithm is a variant of mark-&-sweep 
collection that— unlike prior designs— runs mutator, marker, and sweeper threads 
concurrently without explicit fine-grain synchronization on shared-memory 
multiprocessors. A global, but infrequent, synchronization coordinates the per-object 
coloring marks used by the three threads; fine-grain synchronization is achieve ... 

Mode ls and languages for parallel computation 
David B. Skillicorn, Domenicp Talia 

June 1998 ACM Computing Surveys (CSUR), volume 30 issue 2 
Publisher: ACM Press 

Full text available: Q pdf(298.05 KB) Additional ""formation: MLJatbn, abstract, references, citings, index 

We survey parallel programming models and languages using six criteria to assess their 
suitability for realistic portable parallel programming. We argue that an ideal model 
should by easy to. program, should have a software development methodology, should be 
architecture-independent, should be easy to understand, should guarantee performance, 
and should provide accurate information about the cost of programs. These criteria reflect 
our belief that developments in parallelism must be driven b ... 

Keywords: general-purpose parallel computation, logic programming languages, object- 
oriented languages, parallel programming languages, parallel programming models, 
software development methods, taxonomy 



10 A parallel, incremental, mostly concurrent garbage collector for servers 

Katherine Barabash, Ori Ben-Yitzhak, Irit Goft, Elliot K. Kolodner, Victor Leikehman, Yoav 
Ossia, Avi Owshanko, Erez Petrank 

November 2005 ACM Transactions on Programming Languages and Systems 

(TOPLAS), Volume 27 Issue 6 
Publisher: ACM Press 

i- ii* i ui «<i ^/cooca^dv Additional Information: full citation, abstract, re ferenc es, citings, index 
Full text available: pdf( 683. 5 0 KB ) terms 

Multithreaded applications with multigigabyte heaps running on modern servers provide 
new challenges for garbage collection (GC). The challenges for "server-oriented" GC 



http://portal.acm.org/res^^ 8/10/2007 



Results (page 1): synchronization thread dynamic replication concurrent 



Page 4 of 7 



include: ensuring short pause times on a multigigabyte heap while minimizing throughput 
penalty, good scaling on multiprocessor hardware, and keeping the number of expensive 
multicycle fence instructions required by weak ordering to a minimum. We designed and 
implemented a collector facing these demands building on th ... 

Keywords: Garbage collection, JVM, concurrent garbage collection 



11 Sa pphire: copying GC without stoppin g the world j 
Richard L. Hudson, J. Eliot B. Moss 

June 2001 Proceedings of the 2001 joint ACM-ISCOPE conference on Java Grande JGI 
01 

Publisher: ACM Press 

Full text available- ffl pdf(899 45 KB) Additional Information: f ull cita tion, abstract, references, citings, index 

terms 

Many concurrent garbage collection (GC) algorithms have been devised, but few have 
been implemented and evaluated, particularly for the Java programming language. 
Sapphire is an algorithm we have devised for concurrent copying GC. Sapphire stresses 
minimizing the amount of time any given application thread may need to block to support 
the collector. In particular, Sapphire is intended to work well in the presence of a large 
number of application threads, on small- to medium-scale shared memor ... 

12 Concurrent g a rbage collection using hardware-assisted profiling | 

Timothy H. Heil, James E. Smith 

October 2000 ACM SIGPLAN Notices , Proceedings of the 2nd international 
symposium on Memory management ISMM 'OO, volume 36 issue i 
Publisher: ACM Press 

Full text available: ^| pdf(1.74 MB) Additional Information: full citation , abstract , citings , index terms 

In the presence of on-chip multithreading, a Virtual Machine (VM) implementation can 
readily take advantage of service threads for enhancing performance by performing tasks 
such as profile collection and analysis, dynamic optimization, and garbage collection - 
concurrently with program execution. In this context, a hardware-assisted profiling 
mechanism is proposed. The Relational Profiling Architecture (RPA) is designed from the 
top down, RPA is based on a relational model similar ... 

1 3 JR: Flexible distributed programming in an ex t ende d Java j 
Aaron W. Keen, Tingjian Ge, Justin T. Maris, Ronald A. Olsson 

May 2004 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 26 Issue 3 
Publisher: ACM Press 

Full text available: ^ pdf(1 67,00. KB) Additional Information: full citation, abstract, references, index terms 

Java provides a clean object-oriented programming model and allows for inherently 
system-independent programs. Unfortunately, Java has a limited concurrency model, 
providing only threads and remote method invocation (RMI).The JR programming 
language extends Java to provide a rich concurrency model, based on that of SR. JR 
provides dynamic remote virtual machine creation, dynamic remote object creation, 
remote method invocation, asynchronous communication, rendezvous, and dynamic 
process creation ... 

Keywords: Concurrency, Java, SR, concurrent. object-oriented programming 
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September 1989 ACM Computing Surveys (CSUR), Volume 21 issue 3 
Publisher: ACM Press 

Full text available- f\ pdf(9 62 MB) Additional Information: fulLdtatign, ab stract , references, citings, index 
' ^ : t er ms 

Concurrent logic languages are high-level programming languages for parallel and 
distributed systems that offer a wide range of both known and novel concurrent 
programming techniques. Being logic programming languages, they preserve many 
advantages of the abstract logic programming model, including the logical reading of 
programs and computations, the convenience of representing data structures with logical 
terms and manipulating them using unification, and the amenability to 
metaprogrammin ... 

15 Distributed sy st ems - programming and management: On remote proced ure ca ll 
Patricia Gomes Soares 

November 1992 Proceedings of the 1992 conference of the Centre for Advanced 

Studies on Collaborative research - Volume 2 CASCON '92 
Publisher: IBM Press 

Full text available: ^] pdf(4.5 2 M B ) Additional Information: full citation , abstract, references, citings 

The Remote Procedure Call (RPC) paradigm is reviewed. The concept is described, along 
with the backbone structure of the mechanisms that support it. An overview of works in 
supporting these mechanisms is discussed. Extensions to the paradigm that have been 
proposed to enlarge its suitability, are studied. The main contributions of this paper are a 
standard view and classification of RPC mechanisms according to different perspectives, 
and a snapshot of the paradigm in use today and of goals for t ... 

1 6 The devel o pment of the Eme r ald pr o g rammin g language 
Andrew P. Black, Norman C. Hutchinson, Eric Jul, Henry M. Levy 
June 2007 Proceedings of the third ACM SIGPLAN conference on History of 

programming languages HOPL III 
Publisher: ACM Press 

Full text available: ^[p df (1. 45 MB ) Additional Information: f ull citat i o n, abstract, references, index terms 

Emerald is an object-based programming language and system designed and 
implemented in the Department of Computer Science at the University of Washington in 
the early and mid-1980s. The goal of Emerald was to simplify the construction of 
distributed applications. This goal was reflected at every level of the system: its object 
structure, the programming language design, the compiler implementation, and the run- 
time support. 

This paper describes the origins of the Emerald group, the for ... 

Keywords: Eden, Emerald, Washington, abstract types, call-by-move, distributed 
programming, mobility, object mobility, object-oriented programming, polymorphism, 
remote object invocation, remote procedure call, type conformity 



17 Programming languages for distributed computing systems 
Henri E. Bal, Jennifer G. Steiner, Andrew S. Tanenbaum 
September 1989 ACM Computing Surveys (CSUR), volume 21 issue 3 

Publisher: ACM Press 

[— 1 1 , . 1 ui « auk cn k a o \ Additional Information: full citation, abstract, ref erences , citi ngs, index 
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When distributed systems first appeared, they were programmed in traditional sequential 
languages, usually with the addition of a few library procedures for sending and receiving 
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messages. As distributed applications became more commonplace and more 
sophisticated, this ad hoc approach became less satisfactory. Researchers all over the 
world began designing new programming languages specifically for implementing 
distributed applications. These languages and their history, their underlying pr ... 
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The IBM Blue Gene project has endeavored into the development of a cellular architecture 
computer with millions of concurrent threads of execution. One of the major challenges of 
this project is demonstrating that applications can successfully exploit this massive 
amount of parallelism. Starting from the sequential version of a well known molecular 
dynamics code, we developed a new application that exploits the multiple levels of 
parallelism. in the Blue Gene cellular architecture. We perform ... 

Keywords: Blue Gene, cellular architecture, massively parallel computing? molecular 
dynamics, performance evaluation 
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Recently, lightweight thread libraries have become a common entity to support concurrent 
programming on shared memory multiprocessors. However, the disparity between 
primitives offered by operating systems creates a challenge for those who wish to create 
portable lightweight thread packages. What should be the interface between the machine- 
independent and machine-dependent parts of the thread library? We have implemented a 
portable lightweight thread library on top of Unix on a KSR-1 supercomput ... 
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Many sequential applications are difficult to parallelize because of unpredictable control 
flow, indirect data access, and input-dependent parallelism. These difficulties led us to 
build a software system for behavior oriented parallelization (BOP), which allows a 
program to be parallelized based on partial information about program behavior, for 
example, a user reading just part of the source code, or a profiling tool examining merely 
one or few executions. 

The basis of BOP is program ... 
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